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(54) Title: RECOMBINANT ALVEOLAR SURFACTANT PROTEIN 



(57) Abstract 



The complete coding sequences and 
amino acid sequences for both canine and 
human 10K alveolar surfactant proteins 
(ASP); clones encoding variants of the SP- 
18 and SP-5 forms of human protein are 
disclosed. Methods and vectors for obtain- 
ing these proteins in recombinant form are 
also described. An improved method for 
purification of the 32K protein takes ad- 
vantage of its carbohydrate affinity. 
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Pharmaceutical compositions in the treat- 
ment of respiratory deficiency syndromes 
use the 10K proteins with or without the 
32K form. 
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RECOMBINANT ALVEOLAR SURFACTANT PROTEIN 

Cross-Ref erence to Related Ap plication 
10 This application is a continuation-in-part of 

United States serial no. 008.453. filed 29 June 1987. 
which is a continuation-in-part of United States patent 
application serial no. 857,715. filed 30 April 1986. 
which is a continuation-in-part of United States patent 
15 application serial no. 808.843. filed 13 December 1985 
which is a continuation-in-part of United States patent 
application serial no. 680.358. filed 11 December 1984. 

Technical Field 
20 The invention relates to the field of 

recombinant protein production. More specifically it 
relates to the production of various forms of alveolar 
surfactant protein (ASP) which are useful in the 
management of certain respiratory diseases. 

25 

Background Art 

The human lung is composed of a large number of 
small sacs or alveoli in which gases are exchanged 
between the blood and the air spaces of the lung. In 
30 healthy individuals, this exchange is mediated by the 
presence of a protein containing surfactant complex 
which is synthesized in the microsomal membranes of type 
II alveolar cells. In the absence of adequate levels of 
this complex, a lung cannot properly function — i.e.. the 
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alveoli collapse during exhalation, and cannot be 
subsequently re-inflated by inhaling. Thus, the 
untreated inability to synthesize this complex may 
result in death or in severe physical damage. 



5 



The best documented instance of inadequate 



surfactant complex levels occurs in premature infants 
and infants born after complicated pregnancies, and is 
widely known as respiratory distress syndrome (RDS). A 
widely publicized form of this syndrome has been 

10 designated hyaline membrane disease, or idiopathic RDS. 
RDS is currently the leading cause of infant mortality 
and morbidity in the United States and in other 
developed countries, and substantial efforts have been 
directed to diagnosis and treatment. Current treatment 

15 has focused on mechanical (pressure) ventilation which, 
at best, is an invasive stop-gap measure that often 
results in damage to the lung and other deleterious side 
effects, including complications such as 
bronchopulmonary dysplasia, interstitial emphysema and 

20 pneumothorax. Mental retardation has also resulted on 
occasion when this treatment was used (Hallman. M. , et 
al. Pediatric Clinics of North America (1982) 
29: 1057-1075) . 



25 syndrome by surfactant substitution. This would be a 
method of choice, as. in general, only one 
administration is required, and the potential for damage 
is reduced. For example, Fujiwara, et al. Lancet (1980) 
j.:55-used a protein-depleted surfactant preparation 

30 derived from bovine lungs; the preparation is effective 
but immunogenic. Hallman, M. , et al. Pediatrics (1983) 
7_l:473-482 used a surfactant isolate from human amniotic 
fluid to treat a limited number of infants with some 
success. U.S. Patent 4.312,860 to Clements discloses an 



Limited attempts have been made to treat the 
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thereof ace dramatically more effective than 32K ASP in 
obtaining and maintaining inflation of the lungs and 
that the combination of 10K and 32K proteins is 
synergistic. The invention further relates to DNA 
5 sequences encoding additional mammalian ASP proteins, to 
expression vectors suitable for production of these 
proteins, to recombinant host cells transformed with 
these vectors, and to methods of producing the 
recombinant ASPs and their precursors. In other aspects 
10 the invention relates to pharmaceutical compositions 
containing human ASP and to methods of treating RDS 
using them. 

In still other aspects, the invention relates 
to improved methods to isolate the 32K ASP proteins, and 
15 to purified bovine 10K forms. 

Brief Description of the Drawings 

Figure 1 shows the DNA sequence (along with the- 
deduced amino acid sequence) determined for cDNA 
20 encoding a canine 18 kd ASP protein from overlapping 

cDNA clones, showing the overlapping pD10k-l and pD10k-4 
clones identified. 

Figure 2 shows cDNA sequence and deduced amino 
acid sequence for "cDNA #3" encoding human 18 Jed ASP 
25 protein. 

Figure 3 shows the DNA sequence and deduced 
amino acid sequence of the exon portions of the genomic 
DNA encoding human 18 kd protein. 

Figure 4 shows the sequence of oligonucleotide 
30 probes used to isolate the cDNA encoding human 5 kd/8 kd 
protein. 

Figure 5 shows the DNA and deduced amino acid 
sequence of "cDNA #18" encoding human 5 kd protein. 
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Figure 6 shows an analogous cDNA "#19" encoding 

human 5 led protein. 

Figures 7a and 7b are results of SDS PAGE 

35 

without and with endo F enzyme treatment of S 
5 labeled proteins produced in CHO cells transfected with 
vectors encoding human 18 kd protein. 

Figure 8 shows an SDS gel obtained from 
bacteria transfected with expression vectors for human 
18 kd protein (and controls) labeled with 35 S 
10 methionine. 

Figure 9 shows a Western blot of bacterial 
extracts corresponding to those of Figure 8. 

Figure 10 shows the results of an in. vitro 
determination of the ability of various ASP proteins to 
15 enhance surface tension-lowering by phospholipids. 

Figure- 11 shows the results of an additional in 
vitro determination of the ability of human 18 kd and 5 
kd proteins to enhance surface tension lowering by 
phospholipids . 

20 Figure 12 shows the results corresponding to 

those of Figure 11 for the canine proteins, with and 
without the addition of 32 kd protein. 

Figure 13 shows the nucleotide sequence of a 
canine SP-5 cDNA clone. 

25 Figure 14 shows a comparison of the amino acid 

sequences encoded by two cDNA clones obtained from a 
human lung library in XgtlO. as well as that encoded 
by the genomic clone described as gHS-15 in WO86/03408. 
Also shown are the sequences encoded by two cDNAs 

30 recovered by others. 

Figure 15 shows the nucleotide sequence of a 
synthetic trp promoter used for bacterial expression of 
the surfactant proteins. 
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Modes of Carrying Out the Invention 

A. Def ini t ions 

As used herein, "alveolar surfactant protein 
(ASP)" refers to apoprotein associated with the lung 
surfactant complex and having ASP activity as defined 
hereinbelow. The ASP of all species examined appears to 
comprise one or more components of relatively high 
molecular weight (of the order of 32 Jed) designated 
herein "32K ASP" and one or more quite hydrophobic 
components of relatively low molecular weight (of the 
order of 5-20 kd) designated herein "10K ASP". (King. 
R.J.. et al. J ApdI Physiol (1977) 42:483-491; 
Phizackerley, P.J.R.. Biochem J (1979) 183:731-73 6.) 

The 32K proteins for all species appear to be 
derived from one or more highly homologous prototype 
amino acid sequences in each species. There is evidence 
that "the" 32K protein is encoded by multiple genes with 
minor variations in sequence. Three variants of human 
32K sequence are shown herein. The multiple components, 
found under some conditions, however, of clearly 
differing molecular weights, are due to variations in 
glycosylation patterns. The predecessor application 
hereto. W086/03408. discloses the complete amino acid 
sequence for the human and canine 32K ASP proteins which 
show a high degree of homology. This set of high 
molecular weight, relatively hydrophilic proteins forms 
the subject matter of said predecessor application, and 
the 32K ASP derived from alternate mammalian species is 
expected to exhibit a high degree of homology with the 
canine and human sequences presented. In particular, 
however, two additional variants of the human protein 
are disclosed herein. 



WO 87/06588 



-8- 



PCT/US87/00978 



The low molecular weight "10K" proteins are 
relatively hydrophobic and also appear to be mixtures of 
several proteins of varying molecular weight. Both the 
human and canine proteins exhibit unreduced molecular 
5 weights of 18 kd, 8 kd, and 5 kd . The 8 led and 5 kd 

proteins appear to be identical in N-terminal sequence 
and are presumably derived from the same message but 
contain variations in C-terrainal processing. The 18 kd 
protein, which shows a molecular weight of 10 kd under 

10 reducing conditions, on the other hand, has a clearly 

different amino acid sequence. However, the 18 kd, 8 kd 
and 5 kd proteins of the mammalian species concerned 
herein, all appear to function equivalently in. vivo . 
The invention herein primarily concerns this 10K group. 

15 The predecessor application, W086/03408. disclosed the 

complete cDNA and deduced amino acid sequence for the 18 
kd canine protein, but only a partial DNA sequence for 
. the human counterpart. Only a short N-terminal amino 
acid sequence for the 8 kd/5 kd canine protein was 

20 disclosed; the appropriate cDNA has now been recovered 
for the human protein and the complete sequence of both 
representative 10K proteins made part of the art. 
Because the 10K mixture seems to show products of only 
two DNA sequences, although variations in 

25 pos ttranslational processing can result in multiple 

molecular weights, the designations SP-18 and SP-5 have 
been adopted for these two types of proteins and genes. 

Figure 1 herein corresponds to Figure 2 of 
W086/03408 and shows the complete cDNA sequence for the 

30 mature canine SP-18 protein beginning at leucine shown 
at position 1 and ending at phenylalanine at position 
183. The corresponding sequence for the human SP-18 
protein is shown in Figures 2 and 3, sequences which 
differ only slightly in amino acid sequence as described 
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hereinbelow. The start of the mature protein is the 
phenylalanine residue at position 201 of Figure 2 ending 
with the leucine at position 381. The cDNA thus 
putatively encodes a 181 amino acid protein for the 
5 human. Both the human and dog proteins are. however. 

thought to be processed to shorter sequences by deletion 
of a portion of the carboxy-terminal sequence. For the 
human protein, this is thought to occur so that the 
secreted protein terminates with the arginine shown at 
10 position 286 in Figure 2. Such processing would result 
in a protein of molecular weight about 10K seen in 
reduced electrophoresis gels of isolated mature 
protein. 

The cDNA and deduced amino acid sequences for 
15 two analogous forms of human SP-5 protein are shown in 
Figures 5 and 6. Again, although the cDNA, starting at 
the putative N-terminus of the mature protein encodes 
173 or 174 amino $cids, variations in C-terminal 
processing results in isolated proteins of 5 kd or 8 
20 kd. 

In summary, the 10K group of lower molecular 
weight proteins appears to derive from DNAs encoding two 
different species designated herein SP18 and SP5. The 
SP18 encoded species are so named because they encode a 

25 mature protein which migrates as a protein of 

approximately 18 kd under non-reduced conditions; this 
protein is apparently a dimer of a smaller approximately 
10 kd monomer units. The monomer units are formed by 
pos t-translational processing involving cleavage at the 

30 carboxy terminus of the encoded protein, which, as is 
further explained below, would otherwise comprise 181 
amino acids. Similarly, SP5 encodes a protein of 
putative molecular weight of approximately 19 kd. 
However, again, this molecular weight protein is not 
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found in extracts, and the encoded amino acid sequence 
is evidently processed to the 5 led and 8 kd proteins 
obtained. 

The recombinant ASP proteins of the invention 
have amino acid sequences corresponding to those 
illustrated herein. It is understood that limited 
modifications may. however, be made without destroying 
activity. For example. Figure 14 shows five variants of 
the 32K protein. Further, only a portion of the entire 
primary structure may be required. For example, the 
human SP18 recombinant protein of the invention has an 
amino acid sequence substantially similar to that shown 
in Figure 2. but minor modifications of this sequence 
which do not destroy activity also fall within the 
definition of SP18 human ASP and within definition of 
the protein claimed as such, as further set forth 
below. Also included within the definition are 
fragments of the entire sequence of Figure 2 which 
retain activity particularly those which result from 
post-translational processing. 

As is the case for all proteins, the ASP 
proteins can occur in neutral form or in the form of 
basic or acid addition salts depending on its mode of 
preparation, or. if in solution, upon its environment. 
It is well understood that proteins in general, and. 
therefore, any ASP. in particular, may be found in the 
form of its acid addition salts involving the free amino 
groups, or basic salts formed with free carboxyls. 
Pharmaceutically acceptable salts may. indeed, enhance 
the functionality of the protein. Suitable 
pharmaceutically acceptable acid addition salts include 
those formed from inorganic acids such as. for example, 
hydrochloric or sulfuric acids, or from organic acids 
such as acetic or glycolic acid. Pharmaceutically 
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acceptable bases include the alkali hyroxides such as 
potassium or sodium hydroxides, or such organic bases as 
piperidine. glucosamine, trimethylaraine, choline, or 
caffeine. In addition, the protein may be modified by 
combination with other biological materials such as 
lipids and saccharides, or by side chain modification, 
such as acetylation of amino groups, phosphorylation of 
hydroxyl side chains, or oxidation of sulfhydryl groups 
or other modification of the encoded primary sequence. 
Indeed, in its native form. ASP proteins are 
glycosylated, and certain of the encoded proline 
residues have been converted to hydroxyproline . The 
proteins are also found in association with the 
phospholipids in particular DPPC and PG. Included 
within the definition of any ASP protein form herein are 
glycosylated and unglycosylated forms, hydroxylated and 
non-hydroxylated forms, the apoprotein alone, or in 
association with lipids, and, in short, any composition 
of an amino acid sequence substantially similar to that 
of the native sequences which retains its ability to 
facilitate the exchange of gases between the blood and 
lung air spaces and to permit re-inflation of the 
alveoli . 

It is further understood that minor 
modifications of primary amino acid sequence may result 
in proteins which have substantially equivalent or 
enhanced activity as compared to any particular 
illustrated sequence. These modifications may be 
deliberate, as through site-directed mutagenesis, or may 
be accidental, such as through mutation of hosts which 
are ASP producing organisms. All of these modifications 
are included as long as the ASP activity is retained. 

"ASP activity" for a protein is defined as the 
ability, when combined with lipids either alone or in 
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combination with other proteins, to exhibit activity in 
the in. vivo assay of Robertson, B. Lung (1980) 
158 : 57-68 , and described hereinbelow. In this assay, 
the sample to be assessed is administered through an 
5 endotracheal tube to fetal rabbits or lambs delivered 
prematurely by Caesarian section. (These "preemies" 
lack their own ASP, and are supported on a ventilator.) 
Measurements of lung compliance, blood gases and 
ventilator pressure provide indices of activity. 

10 Preliminary assessment of activity may also be made by 
an in vitro assay, for example that of King. R. J., et 
al. Am J Physiol (1972) 223 : 715-726 . or that illustrated 
below of Hawgood, et al, which utilizes a 
straightforward measurement of surface tension at a 

15 air-water interface when the protein is mixed with a 

phospholipid vesicle preparation. The 10K and 32K ASP 
proteins described herein show ASP activity in 
combination as well as independently. Although it had 
previously been believed that the 10K protein displayed 

20 ASP activity only when acting in concert with the 32K 

family, the inventors herein have now demonstrated that 
the 10K protein alone displays significant ASP activity 
and that supplementation with the 32K protein acts 
synergistically to enhance activity of the 10K 

25 protein(s ) . 

"Operably linked" refers to a juxtaposition 
wherein the components are configured so as to perform 
their usual function. Thus, control sequences operably 
linked to coding sequences are capable of effecting the 

30 expression of the coding sequence. 

"Control sequence" refers to a DNA sequence or 
sequences which are capable, when properly ligated to a 
desired coding sequence, of effecting its expression in 
hosts compatible with such sequences. Such control 
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sequences include promoters in both procaryotic and 
eucaryotic hosts, and in procaryotic organisms also 
include ribosome binding site sequences, and, in 
eucaryotes, termination signals. Additional factors 
5 necessary or helpful in effecting expression may 

subsequently be identified. As used herein, "control 
sequences" simply refers to whatever DNA sequence may be 
required to effect expression in the particular host 
used. 

10 "Cells" or "recombinant host cells" or "host 

cells" are often used interchangeably as will be clear 
from the context. These terms include the immediate 
subject cell, and, of course, the progeny thereof. It 
is understood that not all progeny are exactly identical 

15 to the parental cell, due to chance mutations or 

differences in environment. However, such altered 
progeny are included when the above terms are used. 

B . General Description 
20 The methods illustrated below to obtain DNA 

sequences encoding ASP are merely for purposes of 
illustration and are typical of those that might be 
used. However, other procedures may also be employed, 
as is understood in the art. 

25 

B . 1 . The Nature of the Surfactant Complex 
The alveolar surface of lung has been studied 
extensively by a number of techniques, and by a number 
of groups. It appears that the membrane of the alveolus 
30 is composed of type I and type II alveolar cells, of 

which the type II cells comprise approximately 3% of the 
surface. The type II cells are responsible for the 
exocrine secretion of materials into a lining fluid 
layer covering the basement membrane, which materials 
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decease the surface Censlon 

th . hl T comprised of „ at er derived £rom 

the blood plasma of the alveolar capiUaties and h 
surfactant secretions of the t ype !i 0 ."."'' ^ ^ 

pg of or T e ^ " CellS ' th — "v. contain 60-100 
PS of protein and about 1 Pg of lipid phospnorus ° 

cell where the ratio between type II cell DPPC an iPG 

Phosphorus ls about 8 to 1. studies of the apoprotein 

=".".:■:::••: -~z 

Phospholipid component (King. r. j. et a ° „ 

Respir n<« (1974) 110 .,,,, „ ' 6C a1 ' &s -^ 

i »/«} 110.273) or are not (Shelly, s. a et 
J Lipid Bps, (1975) 16:224). 

weight D /r h " Sh ° Wn hi9h " -or.cul« 

weight Protein obtained by pulmonary lavage 0 f doo, 

separated by gel electrophoresis is composed of "^oT 
components of molecular weight 29.000. 32.000 and 

000 daltons. Xhe 32.000 dalton protein was used to 
obtain sequence data, as ser ^n-v, u , 

n * ^ forth below; however an 

i t:^ have identicai 3 

clycosyLtion n 6nCe in »f 

with eLL, . Dl9eSti ° n ° f thS 36 " •«* 32 M bands 

ThaL I y , C ° S a " F ' WhiCh rem ° VeS carbohydrate side 

30 ;r :;:::; its T ;; ::; d : cts ^ — «*. - 
-«.«./:; th^: ^ -j: : d so o r nen :: is 

- the 32 M faction aggregates into dLer e s en a„d Wn 
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Th e smaller molecular weight proteins are 

difficulty, but these, coo. appear 
extracted with more difficulty. descri ption 
to be mixtures (PhizacKerley et al . Mv . 
b elow>. - both the do, ^ >™» «. «- 

be :: ::::::: fo^rr^.: studied . t - 

with respect protein mixtures 

i thA lower molecular weight protein 
level, the lower ^ sequence . 

appear to contain two types ^ sequences 



10 



15 



20 



ae signated herein SP-18 and 6P-S. ul „ weight 

are encoded by cDNA corresponding t oximately 
primary seguence .« approximate ly^d. ^ ^ ^ 

180 amino acids. Howev nroteins . The SP-5 DNA 

in vivo to shorter proteins, 
processed in ^ approximately 173 amino 

.... >. r 
-»".Tr - rr.:"~.: .•»»- - 

C-terminus o£ the proteins produced. 

B2 . a o J1 i a a_of_Coai^^ 
■.„„ B .. an RSP Proteins 
The en^rTlal^nTh^aT^P 32K |» « 
coding seguences have been cloned and expr essed^s 

several of the low obtained and 
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On h i»<or..tioa .. t £octh ln Figure , herein 

One of these clones, which contained canine ASP encoding 

bacteriophage Xgtlo from idRNA isolated fro. adult 
- human !ung to obtain a human SP-16; which was. in turn 

used to probe a human genomic library. The complete ' 
. seguenceCs, eor human sp _ ie enco<jed ^ 

the genom.c clone are disclosed. Probes designed 
corresponding to the N-terminal amino acid sequence of a 
5 kd canine protein were then used to obtain SP-5 cDNA 
from the Xgtlo lung library. Variants of this 
sequence are also disclosed. 



B.3. Expression of asp 

As the nucleotide sequences encoding the 
additional human and canine ASP proteins are now 
available, these may be expressed in a variety of 
systems. lf procaryotic systems are used, an intronless 
coding sequence should be used, along with suitable 
control sequences. The cDNA clones for any of the above 
ASP proteins may be excised with suitable restriction 
enzymes and ligated into procaryotic vectors for such 
expression. ror procaryotic expression of ASP genomic 
™». the DNA shou ld be modified to remove the introns. 
either by site-directed mutagenesis, or by retrieving 
corresponding portions of cDNA and substituting them for 
the intron-containing genomic sequences. The intronless 
coding DNA is then ligated into expression vectors for 
procaryotic expression. Several illustrative expression 
systems are set forth below. 

As exemplified below. ASP encoding sequences 
may also be used directly in an expression system 
capable of processing the introns. usually a mammalian 
host cell culture. To effect such expression, the 



WO 87/06588 PCT/US87/00978 



genomic sequences can be ligated downstream from a 
controllable mammalian promoter which regulates the 
expression of these sequences in suitable mammalian 
cells. 

5 In addition to recombinant production, proteins 

of the invention of sufficiently short length, such as 
the 5 kd protein, may be prepared by protein synthesis 
methods . 

10 B.4. Protein Recovery 

The ASP protein may be produced either as a 
mature protein or a fusion protein, or may be produced 
along with a signal sequence in cells capable of 
processing this sequence for secretion. It is 

15 advantageous to obtain secretion of the protein, as this 
minimizes the difficulties in purification; thus it is 
preferred to express the human ASP gene which includes 
the codons for native signal sequence in cells capable 
of appropriate processing. It has been shown that 

20 cultured mammalian cells are able to cleave and process 
heterologous mammalian proteins containing signal 
sequences, and to secrete them into the medium 
(Mccormick, F., et al. Mol Cell Biol (1984) 4_:166). 

When secreted into the medium, the ASP protein 

25 is recovered using standard protein purification 

techniques. The purification process is simplified, 
because relatively few proteins are secreted into the 
medium, and the majority of the secreted protein will, 
therefore, already be ASP. However, while the 

30 procedures are more laborious, it is within the means 
known in the art to purify this protein from sonicates 
or lysates of cells in which it is produced 
intracel lular ly in fused or mature form. 
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B . 5 . Improved Method for 32K ASP Purification 
Disclosed herein is a particularly advantageous 

process for the purification of the 32K proteins 

produced either natively or recombinant ly which takes 
5 advantage of the similarity of certain domains of the 

primary sequence to the carbohydrate binding moieties of 

lectins . 

Accordingly, one aspect of the invention herein 
is a process for purification of the 32K ASP proteins 

10 which comprises subjecting ,a mixture containing such 

proteins to affinity chromatography in which the moiety 
responsible for the affinity is a carbohydrate, 
especially mannose or a carbohydrate-bound protein. As 
illustrated below, e.g. mannose itself directly coupled 

15 to a suitable support such as agarose or Sepharose or 
other commonly used chromatographic solid support, or 
glycoproteins containing high levels of mannose may be 
employed. While mannose is most preferred, other 
functional affinity partner carbohydrates include fucose 

20 and N-acetyl glucosamine. The variation of design in 

chromatographic support for a particular affinity group 
is well understood by practitioners of the art, and any 
configuration which provides the carbohydrate as the 
available adsorbent is suitable. 

25 The binding advantageously takes place in the 

presence of low concentrations of calcium ion, and 
elution is advantageously conducted by removal of 
calcium ion using, for example, EDTA. However, elution 
may also be effected by a substance in the elution 

30 solvent which competes with the affinity column for 
binding to ASP, such as increasing concentrations of 
mannose or galactose. Elution can also be performed by 
supplying reducing agents, as reduction of disulfide 
bonds releases the binding, as do high and low pH. 
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While low pH may cause denaturation. elution in borate 
buffer at about pH 10 is effective. 

B. 6 . Assay for ASP Activity 
5 In vitro methods have been devised to assess 

the ability of ASP proteins to function by reducing 
surface tension (synonymous with increasing surface 
pressure) to generate a film on an aqueous/air 
interface. Studies using these methods have been 
10 performed on the isolated native 32K canine ASP. 

(Benson. B.J.. et al Proa Resp Res (1984) 18:83-92; 
Hagwood, S.. et al, Biochemistry (1985) 24:184-190.) 

Tanaka. Y. et al. Chem Pharm Bull (1983) 
3_1:4100-4109 disclose that a 35 kd protein obtained from 
15 bovine lung enhanced the surface spreading of DPPC; 

Suzuki. Y . , J Lipid Res (1982) 23_:62-69: Suzuki. Y. . et 
al. Proa Resp Res (1984) 18.: 93-100 showed that a 15 kd 
.protein from .pig lung enhanced the surface spreading of 
the lipid-protein complex from the same source. 
20 Since the function of the surfactant complex In 

vivo is to create a film at the air/aqueous interface in 
order to reduce surface tension, the ability of ASP 
proteins to enhance the formation of the film created by 
the spread of lipid or lipoprotein at such a surface in 
25 an in vitro model is clearly relevant to its utility. 

An In vivo model, described in the examples, 
may also be employed. 

B.7. Administration and Use 
30 The purified proteins can be used alone and in 

combination in pharmaceutical compositions appropriate 
for administration for the treatment of respiratory 
distress syndrome in infants or adults. The 
compositions and protein products of the invention are 
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also useful in treating related respiratory diseases 
such as pneumonia and bronchitis. The complex contains 
about 50% to almost 100% (wt/wt) lipid and 50% to less 
than 1% ASP; preferably ASP is 5%-20% of the complex. 
5 The lipid portion is preferably 80%-90% (wt/wt) DPPC 
with the remainder unsaturated phosphatidyl choline, 
phosphatidyl glycerol, tr iacylglycerols , palmitic acid 
or mixtures thereof. The complex is reassembled by 
mixing a solution of ASP with a suspension of lipid 

10 liposomes, or by mixing the lipid protein solutions 
directly in the presence of detergent or an organic 
solvent. The detergent or solvent may then be removed 
by dialysis. 

While it is possible to utilize the natural 

15 lipid component from lung lavage in reconstructing the 
complex, and to supplement it with appropriate amounts 
of ASP proteins, the use of synthetic lipids is clearly 
preferred. First, there is the matter of adequate 
supply, which is self-evident. Second, purity of 

20 preparation and freedom from contamination by foreign 
proteins, including infectious proteins, which may 
reside in the lungs from which the natural lipids are 
isolated, are assured only in the synthetic 
preparations. Of course, recons t itution of an effective 

25 complex is more difficult when synthetic components are 
used . 

As noted above, it had been previously been 
believed that the 10K ASP mixture served primarily to 
enhance the activity of the 32K mixture; however, it has 
30 now been established by the inventors herein that a 

preferred composition comprises either a complex with 
the 10K protein alone, the SP-5 or SP-18 protein alone, 
a complex of the 10K and 32K mixtures, or a complex of 
an SP-18 or SP-5 protein and the 32K mixture. In the 
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latter two cases, a preferred protein ratio -- i.e., 
32K:10K or 32K:SP-18 or 32K:SP-5 — is typically in the 
range of 3:1 to 200:1, preferably about 10:1 to 5:1. 
The 32K protein may be added directly to an aqueous 
5 suspension of phospholipid vesicles in an aqueous 

solution. Because it is so hydrophobic, the 10K mixture 
(or the SP-5 or the SP-18 proteins) is added to the 
lipids in an organic solvent, such as chloroform, the 
solvents evaporated, and the vesicles re-formed by 

10 hydration. 

The addition of the 32K protein to the 10K type 
for the administration of the surfactant complex appears 
to have a synergistic effect — i.e.. the combination of 
32K and 10K type proteins exerts the desired activity at 

15 protein concentrations lower than those required for the 
10K protein alone. Accordingly, in a preferred method 
of the invention, the surfactant complex administered 
will contain an effective amount of the 10K mixture, pc 
of the individual SP-5 or SP-18 proteins in admixture 

20 with the 32K ASP. Particularly preferred compositions 
contain the ratios of 32K:10K type protein as set forth 
above, along with a suitable amount of lipid component, 
typically in the range of 50 - almost 100% of the 
composition. 

25 The compositions containing the complex are 

preferably those suitable for endotracheal 
administration, i.e.. generally as a liquid suspension, 
as a dry powder "dust" or as an aerosol. For direct 
endotracheal administration, the complex is suspended in 

30 a liquid with suitable excipients such as, for example, 
water, saline, dextrose, or glycerol and the like. The 
compositions may also contain small amounts of nontoxic 
auxiliary substances such as pH buffering agents, for 
example, sodium acetate or phosphate. To prepare the 
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"dust", the complex, optionally admixed as above is 
lyophilized. and recovered as a dry powder. 

If to be used in aerosol administration, the 
complex is supplied in finely divided form along with an 
additional surfactant and propellent. Typical 
surfactants which may be administered are fatty acids 
and esters, however, it is preferred, in the present 
case, to utilize the other components of the surfactant 
complex. DPPC and PG. Useful propellents are typically 
gases at ambient conditions, and are condensed under 
pressure. Lower alkanes and fluorinated alkanes. such 
as Freon. may be used. The aerosol is packaged in a 
container equipped with a suitable valve so that the 
ingredients may be maintained under pressure until 
released. 

The surfactant complex is administered as 
appropriate to the dosage form, by endotracheal tube by 
aerosol administration, or by nebulization of the 
suspension or dust into the inspired gas". Amounts of 
complex between about o.l m, and 200 mg. preferably 
SO-60 m,/kg body „ eight _ are adrainistered in Qne dQse 
For use in newly born infants, one administration is 
generally sufficient. For adults, sufficient reconsti- 
tuted complex is administered to replace demonstrated 
levels of deficiency (Hallman. M. . et al. J ciini^i 
Investigation (1982) 70:673-682). ~~ 

c - Standard Merhn^c 

Most of the techniques which are used to 

transform cells, construct vectors, extract messenger 
ENA. prepare cDNA libraries, and the like are widely 
practiced in the art. and most practitioners are 
familiar with the standard resource materials which 
describe specific conditions and procedures. These 
methods are set forth with particularity in W086/03408 
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As set forth in this predecessor application, 
expression may be achieved in a variety of host systems 
including, in particular, mammalian and bacterial 
systems, as well as yeast based systems. In addition. 
5 other cell systems have become available in the art. 

such as the baculovirus vectors used to express protein 
encoding genes in insect cells. The expression systems 
set forth below are illustrative, and it is understood 
by those in the art that a variety of expression systems 
10 can be used. 

D . Examples 

D . l. Isolation of Mamm alian ASP Proteins 
15 Canine, human and bovine ASP proteins were 

obtained in purified form. 

D.l.a. isolation of the Canine Surfactant 

Complex 

Lung surfactant complex was prepared from 
20 canine lungs obtained from exsanguinated canines. All 
procedures, including the lavage, were performed at 4°C 
and the isolated material was stored at -15°C. 

The lungs were degassed and lavaged 3 times 
with one liter per lavage of 5 mM Tris-HCl. 100 mM NaCl. 
25 pH 7.4 buffer. The Ca +2 concentration of this buffer 
was less than 5 x lO -6 M (Radiometer F2112 Ca ; 
Radiometer A/S. Copenhagen. Denmark). The pooled lung 
washings were spun at 150 x g av for 15 min (Sorval 
RC2-B ) to remove cellular material. The supernatant was 
30 then spun at 20.000 x g av for 15 hr (Beckman L3-40) 
using a type 15 rotor (Beckman Instruments), and the 
resulting pellet was dispersed in buffer containing 1.64 
M sodium bromide. After equilibration for 1 hr. the 
suspension was spun at 100.000 x g av for 4 hr (Beckman 
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L5-50B) in a SW28 rotor (Beckman Instruments). The 
pellicle was resuspended in buffer and spun at 100,000 x 
g av for 1 hr (Beckman L5-50B) . This pellet containing 
the complex was resuspended in double distilled water. 
5 Pellet resuspended in water at a concentration 

of 10-15 mg phospholipid/ml was injected into a 50-fold 
volume excess of n-butanol (Sigrist. H. , et al, Biochem 
Biophys Res Commun (1977) 74. : 1 7 8-184) and was st irred at 
room temperature for 1 hr . After centr if uga t ion at 

10 10,000 x g av for 20 min (Sorval RC2-B ) , the pellet, 
which contains the 32K ASP is recovered for further 
purification as described below. The supernatant, which 
is a single phase, contains the lipids and the lower 
molecular weight proteins. To obtain the lipids, the 

15 supernatant was dried under vacuum at 40°C and the 

lipids were extracted (Folch, J., et al. J Biol Chem 
(1957) 226 :497-509) . 

To obtain the hydrophobic protein, the 
supernatant Was subjected to Rotovap to remove the 

20 butanol, and further dried by addition of ethanol 

followed by Rotovap. The dried residue was suspended in 
redistilled chloroform containing 0.1 N HC1, and 
insoluble material removed by centr if ugation. 

The resulting solution was chromatographed over 

25 an LH-20 column (Pharmacia) and developed in 

chloroform. (LH-20 is the hydroxypr opyl derivative of 
Sephadex G-50; it is a hydrophobic gel which is inert to 
organic solvents.) The proteins are excluded; 
lipids/phospholipids elute from the included volume. 

30 Protein was recovered from the void volume 

fractions by evaporation of the chloroform under 
nitrogen, and then subjected to sizing on polyacrylamide 
gels. When run under non-reducing conditions, bands of 
approximately 18 kd (identified in W086/03408 as 16.5 
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kd). 8 kd (identified in W086/03408 as 12 kd). and 5 kd 
(identified in W086/03408 as 6 kd) were obtained; under 
reducing conditions, a single broad band of 5-12 kd was 
found . 

The 18 kd. 8 kd. and 5 kd bands from the 
non-reduced gels were subjected to N-terminal analysis 
by Edman degradation, to give the following sequences: 

For 18 kd: ?-Pro-lle-Pro-Leu-Pro-Tyr-Cys-Trp-Leu-Cys- 

Arg-Thr-Leu-lle-Lys-Arg-Ile-Gln-Ala-Met-Ile- 
Pro-Lys-Gly-Val-Leu-Ala-Val-Thr- ? -Gly-Gln- 

For 8 kd: I le-Pro-Cys-Phe-Pro-Ser-Ser-Leu-Lys-Arg-Leu- 
Leu-Ile-Ile-Val-Trp- 

For 5 kd: i le-Pro-Cys-Phe-Pro-Ser-Ser-Leu-Lys-Ar g-Leu- 
Leu-Ile-Ile-Val-Trp- 

The 5-12 kd band also represents a mixture of the 18 kd. 
8 kd and 5 kd proteins, designated herein as the "lOK" 
mixture of proteins. 

The precipitate from the n-butanol extraction 
above was used to obtain the purified 32K apoprotein as 
described in W086/03408 (supra). 

D - 1 - b - Isolati on of Human ASP 

Human 32K and lower molecular weight ASP was ^ 
prepared following the procedure described in the 
published W086/03408. 

The isolated low molecular weight hydrophobic 
proteins show bands corresponding to 18 kd. 8 kd and 5 
kd when subjected to polyacrylamide gel electrophoresis 
under non-reducing conditions. Under reducing 
conditions, a single broad band corresponding to 5-12 kd 
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is obtained. The molecular weights of these bands are 
slightly different from those reported in the published 
application. 

D.l.c. Isolation of Bovine ASP 
The 10K bovine ASP containing 5 kd and 18 kd 
proteins was isolated from the lavage fluid of bovine 
lungs, in a method similar to that used for canine ASP. 

Excised bovine lungs were filled with 
Tris-buf f ered saline, and the fluid removed from the 
lungs by vacuum. The lavage was centrifuged at 200 xg 
for 10 minutes and the supernatant recovered and 
centrifuged at 8-9000 xg for 20 minutes. The 
(surfactant) pellet was then suspended in 0.8M sucrose, 
which has a density greater than the buoyant density of 
the surfactant, and centrifuged at about 100.000 xg for 
three hours. The floating surfactant was then suspended 
in water and sedimented at about 9'-10,000 xg for 20 
minutes to remove the sucrose. 

The phospholipid-rich surfactant was first 
extracted with 98% n-butanol. into which up to 2% 
aqueous surfactant (by volume) was added. This 
one-phase extraction allows solubilization of the 5 kd 
and 18 kd proteins and lipids while causing 
precipitation of the other proteins, which were removed 
by centrifuging at 9-10.000 xg. The butanol solution 
was then chromatographed over an LH-20 gel permeation 
column (Pharmacia) to separate the lipids from the 5 kd 
and 18 kd proteins. The desired protein peak was then 
rechromatographed over LH-60 which separates the 18 kd 
from the 5 kd protein. Both columns are run using 
chloroform: methanol (2:1. v:v) containing 0.5% 0.1N HC1 . 

The purified 5 kd and/or 18 kd proteins, either 
alone or in combination (1:1). were mixed in various 
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weight ratios with synthetic phospholipids to obtain an 
effective surfactant. 

D. 2 . cDNA Encoding Canine 10K AS P Proteins 
5 Messenger RNA extracted from adult canine lung 

tissue was used to prepare a DNA library using GC 
tailing in pBR322 as described in W086/03408 (supra). 

The SP-18 Protein : Two oligomeric probes were 
synthesized corresponding to the N-terminal sequence of 

10 the 18 kd protein using mammalian codon preference 

tables for codon choice. Probe 1198 was a 36-mer of the 
sequence 5 ' -GGTCACAGCCAGGCCCTTGGGGATCATGGCCTGGAT-3 • ; 
probe 1199 was a 45-mer of the sequence 
5 1 -CTTGATCAGGGTTCTGCACAGCCAGCAGTAGGGCAGGGGGATGGG- 3 ' . 

15 Both were labelled with 32 P by kinasing. 

For hybridization, filters were baked at 80°C 
for two hours under vacuum and then washed for 4 hr at 
68°C with shaking in a large volume of 3 x SSC 
containing 0.1% SDS. The filters were prehybr idized for 

20 several hours at 42°C in 6 x SSC. 5 x Denhardt's. 20% 
formamide, 0.1% SDS. and 100 ug/ml sheared, denatured 
salmon sperm DNA. Duplicate filters were hybridized in 
the above buffer containing either 13 ng/ml probe 1198 
or 16 ng/ml probe 1199 at an initial temperature of 

25 68°C. and then at 42°C overnight. The filters were 

washed twice for 15 min at room temperature in 6 x SSC. 
0.1% SDS, 0.05% sodium pyrophosphate, then for 5 rain at 
65°C in the same buffer, and then dried and 
autoradiographed . 

30 Of 40.000 clones screened, 8 hybridized to both 

probes, and were subjected to restriction analysis. Two 
overlapping clones which when combined span 1520 
nucleotides were sequenced, with the results shown in 
Figure 1. These two clones are designated pD10k-l and 
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pD10k-4, and are identified in Figure 1. The arrow 
indicates the beginning of the mature 18 Jed protein. 

cDNA encoding the SP-5 proteins : An oligomeric 
probe was synthesized which corresponded to the putative 
5 sequence of human 5 kd lung surfactant protein. A dog 

lung cDNA library was constructed as described above and 
screened. The cDNA isolated was approximately 800 bp. 
This was not a full-length cDNA. as Northern analysis 
showed that the full-length clone should be about 1.1 
10 kb. The cDNA clone started approximately 30 amino acid 
residues upstream of the N-terminus of the mature dog 5 
or 8 kd protein. A possible clip site (Gln-Gln) which 
would give a protein of approximately 5 kd. 

15 D.3. Human ASP DNAs 

A human genomic library cloned into bacterio- 
phage Charon 28 (Rirara. D. L.. et al # Gene (1980) 
12.:301-3io) was obtained from Dr. T. Maniatis, Harvard 
University. Approximately 1.5 x 10 6 phage were grown 

20 on E . coli K803, and plaque lysates were transferred to 
nitrocellulose filters as described by Benton. W. D. , et 
al. Science (1977) 196 :180-182. Isolation of the 
genomic clone gHS-15 which encodes the 32 kd human 
protein and expression of this gene have already been 

25 described. 

In addition. cDNA libraries from human lung 
were prepared as described previously either by GC 
tailing or in \gtlO. The recovery of cDNA encoding 
the 32 kd human ASP protein was also described in • 

30 W086/03408. 

Disclosed herein, in Figure 14, are amino acid 
sequences encoded by cDNA clones obtained herein from 
the human lung library in XgtlO and designated pHS10-5 
and pHS-10-4. These proteins differ by one and seven 
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amino acids, respectively, from the protein encoded by 
the recovered genomic clone described in W086/03408 
which protein sequence is also shown in Figure 14. The 
remaining- sequences of Figure 14, labeled 6A and 1A. are 
5 additional variants encoded by cDNAs obtained by 

others. it is believed that the 32K human ASP protein 
may be encoded by multiple genes. 

Recovery of sp-ift : As described in the 
published application, the cDNA library in Xgtio was 
10 screened on nitrocellulose filters using lxl0 6 C pm of 

the canine clone pD10k-l described above (and identified 
m Figure 1) in 40% formamide. 5 x SSC, 0.05% SDS 5 x 
Denhardfs. 50 ug/ml yeast tRNA and 50 ug/ml salmon 
sperm DNA for 16 hr at 37°C. (The pD10k-4 segment or 
the full-length combination of the pD10Jc-l and pD10k-4 
clones can be used as well.) The filters were washed 
twice at 50-c for 30 min in 2 x SSC. 0.1% SDS. dried and 
autoradiographed. of 40.000 plaques, two were positive 
and one. designated cDNA #3 containing a 1.5 kb insert 
was chosen for sequencing. The complete nucleotide and 
deduced amino acid sequence for the SP18 protein and its 
precursor are shown in Figure 2. The mature SP18 
protein begins, as shown in the Figure, at nucleotide 
614 with the Phe at 201. it is believed that the 
carboxy terminus of the processed protein is the 
arginine at position 286. The 1.5 kb insert was excised 
and subcloned into EcoRl-cut pUC8; this plasmid 
designated as phl8K-3. was deposited in E. coli K-12 
strain MC1061 with the American Type Culture Collection 
under ATCC accession no. 67276. 
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The P hl8K-3 cDNA insert was used to screen the 
human genomic library (supra) for the gene encoding the 
SP18 protein and its precursors. The sequences of the 
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differences: Asn-138 in W 18 is Thr-138 in tt 19. and 
Asn-186 in # 18 is Ser-186 in tt 19. 

There are two N-terminal amino acid residues 
seen in the human 5 kd and 8 kd proteins, corresponding 
5 to Phe-24 and Gly-25 in Figures 5 and 6. The carboxy 
termini of the 5 kd and 8 kd proteins have not been 
precisely determined; it is postulated that the 8 kd 
protein ends at Gln-108. while the 5 kd protein ends at 
Glu-80 or at Thr-65. 

10 (A canine lung library in pBR322 was prepared 

substantially as described above and screened with the 
human 820 bp clone. The isolated cDNA — designated 
pD6k-ll — was about 800 bp (see Figure 13), not a 
full-length cDNA. The clone started approximately 30 

15 amino acid residues upstream of the N-terminus of the 
mature canine SP-5 protein, and contained a possible 
Gln-Gln clip site.) 

D. 4 . Construction of Mammalian Expression 

20 Vectors 

Vectors suitable for expression of the various 
ASP encoding sequences in mammalian cells, which are 
also capable of processing intron-containing DNA were 
constructed. Expression is controlled by the 

25 metallothionein II (hMTII) control sequences, as 
described by Karin. M. . et al. Nature (1982) 
299 :797-802 . 

An intermediate host vector. pMT was obtained 
by ligating the promoter into pUC8 as follows: 

30 Plasmid 84H (Karin. M. . et al (supra)) which 

carries the hMTII gene was digested to completion with 
BamHI. treated with exonuclease Bal-31 to remove 
terminal nucleotides, and then digested with Hindlll to 
liberate an 840 bp fragment containing nucleotides -765 
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to .70 of the hMTI I gene (nucleotide .1 lm the £irst 
nuc eotide transcribed). The 840 bp 

7' li9 " ed With H1 -»"Hi-c» Rested pucs 
5 u' ti •" ' U982) and the 

ligation mixture transform into E. coli „cio61 The 
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mature form, with an additional Gly residue at the amino 
terminus, and with the putative car boxy- terminal Arg-286 
changed to an Asn residue. 

in another construct. Phe-201 and Ser-287 are 
changed to Asp residues. Cleavage with acid (between 
Asp and Pro) yields a mature form of the SP-18 protein 
missing the N-terrainal Phe-201. and with an additional 
carboxy-terrainal Asp residue. 

An additional construct allows in vitro 
processing of the precursor with a more gentle, 
enzymatic procedure, employing Staph V8 peptidase, which 
cleaves after Glu residues. Advantage is taken of 
natural Glu residues at Glu-198 and Glu-291 by 
converting the Glu-251 to Asp. The 43 kd precursor is 
cleaved with Staph V8 to yield the putative mature SP-18 
protein with an additional Gln-Gln at the amino 
terminus, and Pro-Thr-Gly-Glu at the carboxy terminus, 
in an additional construct. Glu residues can be placed 
in positions 200 and/or 287. 

D. 7 . Expression in Bacteria 

The unglycosylated form of the SP-18 protein 
can be produced in bacteria as a 181 amino acid 
precursor representing met-preceded residues 201-381 or 
as a hydroxylamine-cleavable fusion protein precursor 
with a 15 residue B-galactosidase leader. A modified 
cDNA encoding amino acids 201-381 of the cDNA. preceded 
by ATG is inserted into the Trp controlled vector. 
pTrp-233 (pTrp host vector) between the EcoRI site and 
the HindHI site to give pTrp-20. This construct 
produces a protein of M.W. 20 kd . An analogous 
construct in pBGal host vector. pBGal-20 contains the 
same sequences of SP18 cDNA # 3 fused to a 15 residue 
fl-galactosidase leader through a hydroxylamine-sens it ive 
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for expression in bacteria were also prepared as 
follows. In pTrp-20. codons encoding Arg-286 Ser-287 
were altered to encode Asn-Gly; introducing the 
hydroxylamine-sensitive cleavage site, or the codon for 
5 Ser-287 was replaced by a codon for Asp, resulting in 
the acid-sensitive Asp-Pro cleavage site; or the codon 
for Glu-251 was replaced with a codon for Asp, allowing 
cleavage with Staph V8 at Glu-291 without cleaving the 
desired protein. Also, in both pTrp-20 and pBGal-20, 

10 the sequences 3 1 to the putative carboxy terminal 
Arg-286 were deleted and replaced by a stop codon. 
Neither construct resulted in labeled protein of proper 
size after induction. 

Analogous to pTrp-20, the desired fragment of 

15 the cDNA # 18 (Figure 5) extending from Gly-25 preceded 
by ATG to the carboxy-terminal Ile-197 of the SP-5 
"precursor" was inserted into EcoRI/Hindl I I digested 
pTrp-233 to give pTrp-5 and into pBGal host vector to 
give pBGal-5 wherein the SP-5 sequence is fused to a 

20 fl-galac tos idase leader through a hydroxylamine-sensitive 
Asn-Gly. 

Also, cleavage with Staph V8 of the protein 
expected from this construct at the Glu preceding Phe-24 
and at Glu-66 yields mature 5 Jed protein if the putative 
25 C-terminus is correct. 

These constructs are transformed into E. coli 
W3110 and expressed as described above. 

D. 8 . Purification of the 32K Proteins 
30 The 32K proteins have a striking amino acid 

homology with circulating mannose-binding proteins, and 
also contain residues common to the carbohydrate-binding 
domains of other lectins. It is believed that 
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6 " " flSP " -11 « the other 3 2K pro e Ls 

in the relation of surfactant metabolism or in ot '«" 

exploit the mannose affinity of the proteins so as to 

T . II USi " 9 C " b ° h ^"" «»i-l«>r chromatography 

The chromatographic purification may be carried ou t 
either on an immobilized glycoprotein containing a high 
Proportion of mannose residues (e.g yeast n * 9 
inw»n-ao« % v = .y.. yeast mannan or 

" taSe) or on colu »>^ constructed directly with 
mannose coupled to agarose. 

The 36 kd protein isolated from lung lavage was 
found to bind to Mobilized monosaccharides with a 
broad specificity in the presence of 1 m C a 2+ a 

e P l r Dd £iCati0n PCOOedU " "cordin, to this preferred 
embodiment was carried out as follow. Cell culture 
«-x. (typically e-l« liters, containing 2 . ^ Cac 
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aw Naci. p H 7.5. The bound protein may be 
guant.tatively recovered by elution with 2 M EDTA or 
hapten sugar in the presence of calcium ions. A 

The data set forth in the following table gives 
the percentage of recovered protein bound in the ' 

of from two to seven experiments. The threshold Ca 2 * 
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The 32K protein and the mixture of 10K ASP were 
isolated from canine lung as described above. While the 
addition of 60 ug of the 32K protein was able to 
enhance film formation by the "natural" phospholipid 
obtained from lung almost to the level exhibited by the 
complex per se, it only moderately enhanced film 
formation using synthetic lipid. Similar results were 
obtained for addition of 13 ug of the 10K protein 
alone. However, when 13 ug of the 10K preparation was 
incubated with the synthetic phospholipid vesicles prior 
to the addition of 60 ug of 32K protein, film 
formation occurred at a rate and to a degree comparable 
to that of the natural complex per se. These results 
are shown in Figure 10. 

The results for individual human and canine 
5 kd and 18k proteins are shown in Figures. 11 and 12 
Plotting surface pressure after 3 minutes (y axis) 
versus protein concentration (x axis). As shown in 
Figure 11. the maximum pressure attained is 40-45 mN/ra 
and either 5 kd or 18kd cause the spreading of lipids at 
about 10 ugs. This corresponds to a 

Phospholipid-to-protein ratio of 10:1 since 100 ug of 
lipid was used in all cases; the lipid mixture was 
DPPC:PG (7:3). but 8:2 and 9:1 ratios gave no 
significant difference in results. 

For the canine proteins shown in Figure 12 the 
results are identical to those for the human protein 
Figure 12 also shows the results of experiments in which 
recombinants produced 32K was added to the 18 kd or 5 
kd protein. The synergy between the proteins is shown 
m the circled dots. Ten ug 32 kd protein was added 
to 5 ug and 7.5 ug for 18 kd. and 7.5 and 11 ug 
for 5 kd protein. 
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Bovine 18 kd and 5 kd proteins gave identical 
results to the canine and human proteins. 

D-10. In Vivo Tests 

The control surface active material ( SAM) for 
in_vxvo testing was prepared as follows. Lungs of young 
adult rabbits are lavaged with saline. Healthy rabbits 
are anesthetized through the ear vein with 3 cc of 
sodium pentobarbital. The trachea is exposed and a 
3-way stopcock with a tube attached is inserted into the 
trachea and secured. The chest is opened, the chest 
walls are removed, and the pulmonary artery is 
catheterized with a size 8 feeding tube from the heart 
The circulation is flushed with 50 ml of normal saline 
while ventilating the lungs through the tracheal tube 
with a 60-ml syringe, and the lungs are then carefully 
removed with the trachea intact. sixty ml of normal 
saline are instilled into the" lungs through the tracheal 
tube, the lungs are then gently massaged for one minute, 
and the saline is withdrawn. Lavage is repeated four 
times, and the washings are pooled. Cell debris is 
removed from the lavage fluid at room temperature by 
centrifugation at 1000 x g for two hours. The pellet is 
suspended in 0.1 N saline plus 2 M CaCl at a final 
25 concentration of 10 mg/ml phospholipid." Concentration 
is adjusted by extracting the lipids with choloroform 
and methanol and measuring lipid phosphorus. m the 
bubble tensitometer this material give rapid adsorption 
(time constant 0.3 sec or less) and minimum surface 
B0 tensions of 0 to 3 mN/m on 50% reduction of area ' 
Maximum tension on expansion was 32 to 35 mN/ra. 

The subject and apparatus used for in vivo 
testing are as follows. Healthy, young, time-dated 
pregnant does are obtained from White Hare Rabbitory of 
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Missouri. At 21 or 22 days gestation the does are 
air-shipped and are checked upon arrival to assure that 
they are pregnant and healthy. Does are housed in 
standard large rabbit cages in the rabbit facility 
(1492-S) and are re-examined the day before use. 

Four plethysraographs were constructed with 
80inch lengths of 2-inch diameter acrylic cylinder to 
which are affixed a 3-inch long chimney of 1/2-inch 
acrylic tubing (id. 0.5 inch). The chimney is filled 
with enough cotton gauze to create a low resistance to 
air flow in and our of the plethysraograph. Flow in and 
out of the chamber is determined by measuring the 
differential pressure change between the inside of the 
plethysmograph and the room. (Time constant <0.1 
seconds). Leads are taken from the end of the main 
cylinder to a pressure transducer (Validyne DP45. 
Validyne Engineering Company. Northridge. CA) and to a 
calibrating syringe. When conducting experiments, the 
electrically integrated flow (volume) signal is 
frequently calibrated with the syringe. The other end 
of the main cylinder is sealed with a 2-inch rubber 
stopper through which were placed two 4-8nch metal rods 
and through which were pulled three ECG leads. Cotton 
sheeting is placed between the two metal rods forming a 
sling on which the experimental animal is placed. 
Bayonet-type electrodes are attached to the ECG leads. 
aN adapter is placed through the stopper so that the hub 
of the tracheal angiocath can be connected to a 
flow-through manifold which in turn is attached to the 
tubing from a respirator (Mark VIII. Bird Respirator 
Company. Palm Springs. CA) . The external deadspace of 
the airway is 0.05 to 0.07 ml. Airway pressure is 
measured in the manifold with an Alltech MSDICE/1 
transducer (Alltech. City of Industry. CA) . The 
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placed on the slings, and the ECG electrodes attached 
and the tracheotomy tube connected to an adapter 
connected to a respirator. The average elapsed time 
from delivery to the beginning of assisted ventilation 
is 10 minutes, maximum elapsed time is 15 minutes. 
Ventilation is begun with oxygen at a frequency of 48 
breaths/minute using an inspiratory time of 0.35 
seconds. For the first minute, the ventilatory settings 
are the same for all animals; inspiratory time 0.35 
seconds, peak inspiratory pressure 40 cmH z O. After 
the first minute the inspiratory pressure is adjusted to 
keep the tidal volume at 6.5 - 7.5 ml/kg. Animal weight 
is about 30 g so this is achieved with an absolute 
volume of about 0121 ml. The flow, tidal volume and 
airway pressure are recorded every five minutes for each 
of the three littermates. Animals are ventilated for 30 
minutes . 

Data from all animals in a set are rejected if 
one member develops an air leak or dies of other causes. 

After 30 minutes ventilation the tracheal tubes 
are closed with stopcocks and the lungs are allowed to 
degas for 10 minutes. Then each air-filled angiocath is 
connected to a horizontal, calibrated length of 5 mm 
plastic tubing containing 3 ml of air at the lung end 
and dyed water at the other end. The fluid-filled ends 
of three plastic tubes are connected via a manifold to a 
single reservoir of dyed water, whose surface is at the 
same level as the tubes. This reservoir can be raised 
in 50 cm water steps which correspondingly increase the 
pressure in the tubing and lungs. As the pressure 
increases or decreases, gas enters or leaves the lungs, 
displacing the fluid column, allowing measurement of the 
changes in gas volumes. This apparatus is similar to 
that described by Robertson. B. Lung (1980) 15_8:57-68. 
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The pressure is raised stepwise from 0 to 5. 10. 15. 20. 

25. 30 cmH 2 0. with a pause for one minute at each 

level before recording the volume change. after one 

minute at 20 mmH 2 0. the pressure is decreased by 5 

cmH z O decrements, again maintaining each pressure for 

one minute before recording the volume. Each volume 

measurement is corrected for compression. During the 

studies the animals are kept at 37°C by placing them in 

a water bath just below the surface. 

Data are obtained for P INS » compliance (C) 

and volume at specific pressures (Vp) . P,„„ is the 

INS 

pressure required to maintain a net lung volume; lower 

numbers, of course, indicate efficacy. Compliance, a 

measure of how easily the lungs are inflated, is also 

measured, and higher values are desired. Vp is the 

volume in cm of the lungs at the noted pressure in cm 

of water. The results are as follows. 

_ For P T . TO at 30 minutes, the results are as in 
• INS 

Table 1 (PL is phospholipid; 32K protein is human 32 kd 
ASP produced in CHO cells; 10K is a mixture of 5 kd. 8 
kd and 18 kd isolated native human proteins). 







Table 1 
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TREATMENT 
SAM 


n 
13 


p 

-INS 
18 + 4 




Saline (control) 


9 


31 + 1 




PL alone 


4 


32 + 1 




PL + 32K 


3 


28 + 5 




PL + 10K 
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17 + 2 


30 


PL + 10K (200:1) 
+ 32K (4:1) 


8 


20 + 5 




PL + 10K (200:1) 


5 


25 + 8 




PL + 18 kd (50:1) 




21. 22. 




PL + 5 kd (50:1) 




19 
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As shown in Table 1. 32K alone is minimally- 
effective, while the 10K mix or 5 kd or 18 kd proteins 
alone are reasonably effective. Addition of the 32K 
protein to the 10K mix. however, enhances the 
5 effectiveness. 

For compliance. Table 2 shows similar results. 



Table 2 
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TREATMENT 


n 


Compliance 


SAM 


13 


0 .441 + 0 . 113 


Saline (control) 
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0.243 + 0.025 


PL alone 
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0 .219 + 0.028 


PL + 32K 
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0.247 + 0.029 


PL + 10K 
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0.467 + 0.078 


PL + 10K (200:1) 


8 


0.401 + 0.041 



+ 32K (4:1) 



PL 
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10K (200:1) 
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0. 328 


+_ 


0. 


. 176 


PL 


+ 


18 kd (50:1) 




0.4 




0. 


.045 


PL 


+ 


5 kd (50:1) 




0.4 




0. 


. 045 



20 

Again the 10K mix or the 18 kd and 5 kd 
proteins show good activity, and while the 32K is much 
less effective, addition of the 32K protein greatly 
enhances activity of the 10K mix. 
25 Tables 3 and 4 show V 3Q and V 5 , (30 cm 

water and 5 cm water) respectively. 
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shown in Figure 15. The Ndel site of the starting 
plasraid is eliminated by digesting pKK233-2 with Ndel. 
blunting with Klenow, and religating. The Ndel-minus 
product was then digested with EcoRI and Pstl. and 
ligated to an EcoRI/PstI digest of the synthetic trp 
promoter of Figure 15 to obtain the desired vector. 
pTrp233. 

To prepare pBGal host vector. pTrp233 was 
digested with EcoRI. purified on a gel. and blunted with 
Klenow. The plasmid was relegated and amplified in E^ 
coli to give the corresponding plasmid lacking the EcoRI 
site. A synthetic oligonucleotide sequence encoding the 
amino terminus of B-galactosidase followed by 6 
threonine residues, 

bet a . B al ••' ( * hr)6 1 f3 , 

CO *TATGACCAT6ATTACG I AATTTAACCACCACCACCACCACCGAATTCATTA 

T21 .ACTOGTACTAA T6CTTAA. ATTGOT6OTG6T60TOGTGGCTTAA0TAATTC0A, L 

Nd .r ~ : Hindi II 

was ligated into Ndel/Hindlll digested intermediate 
plasmid. and plasmids containing the insert (pBGal host 
vector) identified by susceptibility to EcoRI cleavage. 

To construct pTrp-20. a portion of the SP-18 
cDNA #3. along with a synthetic fragment, was ligated 
into Ndel/Hindll digested pTrp233. The SP-18 fragment 
ligated into pUC-9 described above was excised by 
digesting with Pstl (cuts at nucleotide 694) and with 
HindTII (cuts past the 3' end in the plasmid 
polylinker ) . Two oligonucleotides were prepared, which, 
when annealed, encode the residues upstream of 
nucleotide 694 to the N-terminus (residue 201) and a 
preceding methionine (ATG) : 

TATGTTCCCCATTCCTCTCCCCTATTGCTGGCTCTGCA and 
GAGCCAGCAATAGGGAGAGGAATGGGGAACA. These oligonucleotides 
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were annealed, ligated to the excised cDNA. and inserted 
into the digested vector to obtain pTrp-20. 

To construct pBGal-20, an analogous procedure 
using EcoRI /Hindi I I digested pBGal host vector, 
5 Pstl/Hindll.I excised SP-18 DNA, and the filler 
nucleotides: 

AATTGAACGGTTTCCCCATTCCTCTCCCCTATTGCTGGCTCTGCA and 
GAGCCAGCAATAGGGGAGAGGAATGGGGAAACCGTTG. to give 
pBGal-20. 

10 Vectors for the expression of the gene encoding 

shorter forms of SP-18 were constructed from pTrp-20 or 
pBGal-20. To construct pTrp-9. pTrp-20 was cut with 
Ncol (nucleotide 846) and Hindlll, and rejoined with the 
annealed oligonucleotides CATGGATGACAGCGCTGGCCCAGGGTA 

15 and AGCTTACCTTGGGCCAGCGCTGTCATC . pBGal-9 was 

constructed in a completely analogous manner using 
pBGal-20 as starting material. 

To construct vectors encoding SP-5. . cDNA #18 
(Figure 5) was digested with Smal (nucleotide 94 - 

20 nucleotide 680) and the Smal-excised fragment inserted 
into the Smal site of pUC8. From the cloned gene, the 
fragment excised by digestion with ApaLI (nucleotide 
123) and Hindlll (linker) was ligated with Ndel/Hindlll 
digested pTrp-233 and the joining annealed nucleotides: 

2 5 TATGGGCATTCCCTGCTGCCCAG and TGCACTGGGCAGCAGGGAATGCCCA. 
to obtains pTrp-5. 

Similarly. pBGal-5 (N:G) and pBGal-5 (V8) were 
constructed using the same cDNA excised fragment, pBGal 
host vector cut with EcoRI and Hindlll, and the 

30 nucleotide pairs: AATTCAACGGCATTCCCTGCTGCCCAG and 
TGCACTGGGCAGCAGGGAATCCCGTTG ; and 

AATTCGGCATTCCCTGCTGCCCAG and TGCACTGGGCAGCAGGGAATGCCG , 
respectively. 



WO 37/06588 PCT/US6 



1. An alveolar surfactant protein (ASP) f 
of proteins normally accompanying it in situ, which 
selected from the group consisting of a protein enc 
by human SP-18 DNA and human SP-5 DNA including the 
processed forms thereof. 

2. The protein of claim 1 which is encode 
the DNA illustrated in Figure 2, Figure 3. Figure 5 
Figure 6. or the substantial equivalent thereof. 

3. A recombinant DNA sequence which encod 
the protein of claim 1. which DNA is free of DNA 
encoding proteins normally accompanying the protein 
claim 1. 

4. The DNA of claim 3 operably linked to 
control sequences effective in expressing said sequ 
in suitable recombinant host cells. 

5. Recombinant host cells transformed wit! 
DNA of claim 3 or the expression system of claim 4. 

6. A method of producing ASP which compri: 
culturing the cells of claim 5. 

7. Recombinant ASP produced by the method 

claim 6. 



8. A method to purify 32K ASP protein whi 
comprises subjecting a mixture containing said 32K 
protein to carbohydrate affinity chromatography usii 
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carbohydrate-bound support by (a) contacting the mixture 
with the carbohydrate-bound support under conditions 
wherein the. 32K protein is adsorbed to the 
carbohydrate-bound support; and (b) eluting the 32K 
protein under conditions wherein the protein is not 
bound . 

9. The method of claim 8 wherein the 
carbohydrate affinity is supplied by a mannose residue, 
the binding of the 32K protein to support is conducted 
in the presence of calcium ion. and the elution of said 
32K protein is under conditions which diminish calcium 
ion concentration. 

10. ASP protein purified by the method of 

claim 8. 

11. A pharmaceutical' composition effective in 
treating respiratory (RDS) in mammals which composition 
comprises the protein of claim 1 or claim 7 or claim 10 
in admixture with a phospholipid preparation and. 
optionally, with a pharmaceutical^ acceptable excipient. 

12. A pharmaceutical composition effective in 
treating RDS in mammals which composition comprises the 
protein of claim 1 or claim 7 or claim 10 in admixture 
with an effective amount of 32K ASP protein in admixture 
with a phospholipid preparation and. optionally, with a 
pharmaceutically acceptable excipient 
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His CLu Cys 



200 V— » 
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; TGT GAC GTT CTC CCC TTG AAG CTG CTC 
300 
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:? a a a a a a a s a a a a a a a a a a a a a a a a s 
' a a a a a a a a a a a a a a a a a a a a a a a a a a a a a ; 

■ The Val Leu Leu Leu Asp ; 



SSO 
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His RET pco Acg Leu Gin 
CAC ATG CCT CCG CTG CAG 



a a a a a a a a a a a a a a a as a a a a a a 

-h- «,« c 01. Al. 1«" «, Ala cy. J" £« J£ J£ ^ S3 «i i« a 
aSc AM ICC CAC CCC CTC COC CCC TCT "^JfJ «» "<= * CT C " " 

CAACCACT CAAACCCATC CCACCCCAAA CCACACCCAC TTCCTTCTCA COTOCACCCA ACCCACCAC* CTCTCCAOCA CATCCCCAAC AC< 
CTTC COCCCTCATA *JZcC ACAACTCACA CCCAAC«CA CCCACCCCAC CTCCCCTAAC CCCCCACCCC TCCCTCCACC CAACACATCA «CACATC« « 

cccrcc cttcatIc TTTCATCTCC ATCTCCATAA CACACTACCT tttacactta ^tcctaat actttcataa aacttaaatt caccacaata «c. 

1300 fid'iU. ( 1350 

ATGAAGTA CCGTAGAAGA TAGATATACT CAGAGCAAGCTT j 



183 

rco His Phe End 
CCT CAC TTC TGA 

1100 
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Lgure Human SP18 cONA 13 ._^~-nrv-,- att CCT CAG TCA CAC CTC CTC CAG TCC CTG CTC CTG CTG CTC CCC ACC 

/ GAATTOCCGTCCC ATC OCT CAC TCA CAC ^ ^ Trp ^ ^ ^ ^ L. u Pro The 

SSS5S£2S!!|££S = = S5S2SS£2SSS:SS!ESS 

200 



22 £ E s s ss s S ss s: ss s 2 ss ss 2 s 2: 2 s 2 - 2 2 as 2 2 ss 
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400 
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500 
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' 600 
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200 201 
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>c acc ccc cac err tga tgacaactcag ctctccagaa aaagacacct cctttaaSt cctccactat GGCCAGACAG TGCTCCCTCA CACCTCCAAT CCCACC 



lis Ser Pro Asp Leu End 
381 



\CCT TACCACCCCC ACCCAGCACC ATCC 



. '3 
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Exon I - • CTCCC ATC CCT GAG TCA CAC CTG CTG CAG TGG CTG CTG CTG CTG CTG CCC ACS CTC TCT GGC CCA CGC ACT G £tga. . 

MET Ala Clu Ser Bis Leu Leu Cln Trp Leu Leu Leu Leu Leu Pro Thr Leu Cys Cly Pro Gly Thr A 



TGG 


ACC 


ACC 


TCA 


TCC 


TTC 


CCC 


TCT 


CCC CAC 


CGC 


CCT CAC TTC TCC 


Trp 


Thr 


Thr 


Ser 


Ser 


Leu 


Ala 


cys 


Ala Gin 


cly 


Pro Glu Phe Trp 


CCC 


CTA 


CCC 


CAT 


TCC 


CTA 


CAC 


CAA 


CTC TGG 


CCA 


CAT CTC CCA CCC 


Ala 


Leu 


cly 


Bis 


Cys 


Leu 


Cln 


Clu 


Val Trp 


Cly 


Bis Val Cly Ala 



2052 

ccag CAT CAC CTA TGC CAA CAC TCT CAC CAC ATC CTC CAC ATC CTT AAC AAC ATC CCC AAG CAC CCC ATT TTC CAC gcaa. . 
Asp Asp Leu Cys Gin Clu Cys Clu Asp He Val Bis He Leu Asn Lys MET Ala Lys Clu Ala lie Phe Gin 



CAC 


ACC 


ATC 


ACC 


AAC 


TTC 


CTC 


CAC 


CAC 


CAC 


TGC 


AAC 


CTC 


CTC 


CCC 


Asp 


Thr 


MET 


Arg 


Lys 


Phe 


Leu 


Glu 


Cln 


Clu 


Cys 


Asn 


Val 


Leu 


Pro 


CTT 


CAC 


CAC 


TAC 


TTC 


CCC 


CTG 


CTC 


ATC 


CAC 


TAC 


TTC 


CAG 


AAC 


CAG 


Leu 


Asp 


Asp 


Tyr 


Phe 


Pro 


Leu 


Val 


lie 


Asp 


Tyr 


Phe 


Cln 


Asn 


Gin 



GAC 


TCA 


AAC 


CCC 


ATC 


TCT 


ATC CAC 


CTG 


GGC 


CTC 


TGC 


AAA 


Asp 


Ser 


Asn 


Gly 


He 


Cys 


MET Bis 


Leu 


Gly 


Leu Cys 


Lys 


GAC 


CCC 


CTC 


CCC 


AAA 


CCT 


CTG CGC 


CAC 


CCT 


CTC 


CCA 


CAC 


Asp 


Pro 


Leu 


Pro 


Lys 


Pro 


Leu Arg 


Asp 


Pro 


Leu 


Pro 


Asp 


CCC 


CTC 


CAG 


GCG 


AGG 


CCT 


GGG CCT 


CAC 


ACA 


CAC 


gtga... 


Ala 


Leu 


Gin 


Ala 


Arg 


Pro 


Gly Pro 


His 


Thr 


Gin 
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.ccag CAT CTC TCC GAG CAG CAA TTC CCC ATT CCT CTC CCC TAT TGC TGG CTC TGC AGG CCT CTG ATC AAG CGG ATC CAA CC 
Asp Leu Ser Glu Gin Gin Phe Pro He Pro Leu Pro Tyr Cys Trp Leu Cys Arg Ala Leu He Lys Arg lie Cln Al 

200 201 
ATG ATT CCC AAG -gtga... 
MET He Pro Lys 

4599 
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CTG 
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CTC CCC 
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He 
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Asp 
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He 


Pro Cln 
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CAA 
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CAG CAC 


ACC 
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Cys Lys 
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Val 


Glu 


Gin Bis 


Thr 






CAG gtac... 
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TCT 


GGG 


ACC ATG 


TCC 






Ala Leu Cly 


Val 


Cys 


Cly 


Thr MET 


Ser 
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pD6k-ll ^ 

50 

ATT CGG GGC AGC AAG ATG GAT GTG GGC AGC AAG GAG GTC TTG ATC GAG AGC CCG 
He Arg Gly Ser Lys MET Asp Val Gly Ser Lys Glu Val Leu He Glu Ser Pro 

100 

CCG GAC TAC TCA GCA GCT CCC CGG GGC CGG TTG GGC ATC CCC TGC TTC CCT TCG 
Pro Asp Tyr Ser Ala Ala Pro Arg Gly Arg Leu Gly He Pro. Cys Phe Pro Ser 

150 

TCC CTC AAA CGC CTG CTC ATC ATC GTA GTA GTG ATA GTC CTT GTG GTC GTG GTG 
Ser Leu Lys Arg Leu Leu He He Val Val Val He Val Leu Val Val Val Val 

200 

ATT GTC GGC GCT CTG CTA ATG GGT CTT CAC ATG AGC CAG AAA CAC ACT GAG ATG 
He Val Gly Ala Leu Leu MET Gly Leu His MET Ser Gin Lys His Thr Glu MET 

250 

GTC CTA GAG ATG AGC ATG GGG GGG CCA GAA GCC CAG CAG CGC CTG GCC CTG CAG 
Val Leu. Glu MET Ser MET Gly Gly Pro Glu Ala Gin Gin Arg Leu Ala Leu Gin 

300 

GAG CGT GTG GGC ACC ACT GCC ACC TTC TCC ATT GGC TCC ACT GGC ATC GTA GTG 
Glu Arg Val Gly Thr Thr Ala Thr Phe Ser He Gly Ser Thr Gly He Val Val 

350 

TAT GAC TAC CAG CGG CTC CTG ATT GCC TAT AAG CCA GCC CGG GGA ACC *TGT TGC 
Tyr Asp Tyr Gin Arg Leu Leu He Ala Tyr Lys Pro Ala Arg Gly Thr Cys Cys 

400. 

TAC ATC ATG AAG ATG ACT CCA GAG AAC ATC CCA AGT CTT GAG GCT CTC ACT AGA 
Tyr He MET Lys MET Thr Pro Glu Asn He Pro Ser Leu Glu Ala Leu Thr Arg 

" '' 450 

AAG TTT CAG GAC TTC CAG GTC AAG CCA GCC GTG TCT ACC TCT AAG CTG GGA CAG 
Lys Phe Gin Asp Phe Gin Val Lys Pro Ala Val Ser Thr Ser Lys Leu Gly Gin 

500 

GAG GAG GGC CAT GAT GCT GGC TCA GCA TCC CCT GGG GAT CCC CTG GAC TTC CTG 
Glu Glu Gly His Asp Ala Gly Ser Ala Ser Pro Gly Asp Pro Leu Asp Phe. Leu 

550 

GGC ACC ACA GTG AGC ACC CTG TGT GGT GAG GTG CCC CTC TTC TAC ATC TAG GAC 
Gly Thr Thr Val Ser Thr Leu Cys Gly Glu Val Pro Leu Phe Tyr He End 

600 650 
CCCTCA GGACCCACGG AGGCCCCAGG TGAGGAGGGA AGATCCACGC TCAAAGGGTC TTTGGCAGA 

700 

G AC G C GGG AAG ATGCTCCTGC CCACACCACG GGGACCAGCG CTGGCGAAAT GGGAGCTGTG GGG 

750 

AG AG GTG GGAGCGGGCA GGAGCTGCGG CTCCTGGGCA CACGGGGCTC CGACCACGAA AGAATAAA 
809 

GC AACCTGATTG CCCGAATTC 
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COMPARISON OF PSAP SEQUENCES 



10 20 30 40 50 60 
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'0 80 90 100 110 120 
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T N V R A 

T N V R P 



130 140 150 160 170 180 

TRGALSLQGS IMTVGEKVFS SNGQSITFDA IQEACARAGG RIAVPRNPEE NEAIASFVKK 



190 200 210 220 230 240 

YNTYAYVGLT EGPSPGDFRY SDGTPVNYTN WYRGEPAGRG KEQCVEMYTD GQWNDRNCLY gene 

pHSlO- 
6A 

pHSlO- 
1A 



SRLTICEF. gene 

pHS10-5 
6A 

pHSlO-4 
1A 
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